Learning Mazes with Aliasing States: An LCS Algorithm with Associative Perception
نویسندگان
چکیده
Maze problems represent a simplified virtual model of real environments that can be used for developing core algorithms of many real-world application related to the problem of navigation. However, the best achievements of Learning Classifier Systems (LCS) in maze problems are still mostly bounded to non-aliasing environments, while LCS complexity seems to obstruct a proper analysis of the reasons of failure. Also, despite the fact that the maze environment problem has a long history of usage in research into learning, there has been little analysis on the complexity of maze problems. To overcome these restrictions we try to improve our understanding of the nature and structure of maze environments. We analyze mazes used in research for the last two decades, introduce a set of maze complexity characteristics and develop a set of new maze environments. We then construct a new LCS agent so that it has a simpler and more transparent performance mechanism, and still could solve mazes better than existing algorithms. We use the structure of a predictive LCS model, strip out the evolutionary mechanism, simplify the reinforcement learning procedure and equip the agent with the ability of associative perception, adopted from psychology. We then run our new LCS with associative perception through the old and new aliasing mazes, which represent partially observable Markov decision problems (POMDP) and demonstrate that it performs at least as well, and in some cases better than other published results. Running Head: Learning Mazes with Aliasing States
منابع مشابه
AgentP Model: Learning Classifier System with Associative Perception
Aliasing environments present the tasks of increased difficulty for Learning Classifier Systems. Aliasing squares look identical for an agent with limited perceptive power, but may demand a completely different optimal strategy. Thus, the presence of aliasing squares in a maze may lead to a non-optimal behaviour and decrease the agent’s performance. As a possible approach to the problem we intr...
متن کاملA Reinforcement Learning Agent with Associative Perception
One of the most perspective ideas of further development of Reinforcement Learning (RL) research involves using associative learning models to improve performance of reinforcement learning agents. Learning Classifier Systems (LCS) have proved to be one of the most successful classes of RL methods that have been applied to maze environments. However, so far LCS have shown their effectiveness for...
متن کاملAnalysing Learning Classifier Systems in Reactive and Non-reactive Robotic Tasks
There are few contributions to robot autonomous navigation applying Learning Classifier Systems (LCS) to date. The primary objective of this work is to analyse the performance of the strength-based LCS and the accuracy-based LCS, named EXtended Learning Classifier System (XCS), when applied to two distinct robotic tasks. The first task is purely reactive, which means that the action to be perfo...
متن کاملبررسی تاثیر تجربههای پیشین یادگیری بر یادگیریهای بعدی
Purpose of the study: This study aimed to investigate the effect of prior learning experienes on later learning(The role of previous experience in the transfer of learning) by comparing Maze learning speed in mice with and without Maze experiences. Methodology: The research project used an experimental design (post-test design with a control group). The study consisted of a sample of 14 rats...
متن کاملAliasing in XCS and the Consecutive State Problem: 1 - Effects
Whilst XCS (Wilson, 1998) has been shown to be more robust and reliable than previous LCS implementations (Kovacs, 1996, 1997), Lanzi (1997) identified a potential problem in the application of XCS to certain simple multi-step non Markovian environments. The 'Aliasing Problem' occurs when the environment provides the same message for two states in environmental positions that generate different...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Adaptive Behaviour
دوره 17 شماره
صفحات -
تاریخ انتشار 2009